A Distributed Ordered Dictionary with (1 + ) Load Balance
نویسندگان
چکیده
We consider how to maintain a distributed dictionary over a set of nodes such that each node stores all the keys in one contiguous range of the (ordered) key domain. Such range-partitioned dictionaries are commonly used in parallel databases as they enable efficient range queries. As keys are inserted and removed from the dictionary, the partitioning needs to be adjusted in order to ensure storage balance across nodes. We develop an online algorithm that ensures that the asymptotic ratio of storage load between any pair of nodes is at most (1 + ), for any constant > 0, while ensuring that the amortized cost per key insertion or deletion, measured as the number of keys that are migrated across nodes, is constant. Our algorithm can be extended to work for peer-to-peer systems where nodes themselves may join and leave the distributed dictionary.
منابع مشابه
EFL Translation Students' Perspective toward Using Bilingual Dictionary in Translation of Polysemous Words
This research presented the use of bilingual dictionary and addressed the EFL translation students' points of view on the use of bilingual dictionary in translating polysemous words (English to Persian). Moreo- ver, it aimed at finding the possible relationship between the effect of using bilingual dictionary by stu- dents in translating polysemous words and their achieved scores. In the study ...
متن کاملTask-Induced Involvement in L2 Vocabulary Learning: A Case for Listening Comprehension
The study aimed at investigating whether the retention of vocabulary acquired incidentally is dependent upon the amount of task-induced involvement. Immediate and delayed retention of twenty unfamiliar words was examined in three learning tasks( listening comprehension + group discussion, listening comprehension + dictionary checking + summary writing in L1, and listening comprehension + dictio...
متن کاملLoad Balancing in Distributed Web Server Systems with Partial Document Replication
How documents of a Web site are replicated and where they are placed among the server nodes have an important bearing on balance of load in a geographically Distributed Web Server (DWS) system. The traffic generated due to movements of documents at runtime could also affect the performance of the DWS system. In this paper, we prove that minimizing such traffic is NPhard. We propose a new docume...
متن کاملAchieving Better Load Balance in Distributed Storage System
The CPU processing speed and disk capacity are increasing tremendously during the past decade. However, the even faster increasing number of users generates higher requirements for high performance and huge capacity computer systems. More and more applications are now running on distributed systems. Load balance is an important issue in distributed storage systems. Currently, very few of them h...
متن کاملAdaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments
Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...
متن کامل